-
Notifications
You must be signed in to change notification settings - Fork 3k
Fill target errors with nans for evaluate #41919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Add pyrit and not remove the other one
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR enhances the target application step by filling any rows that fail execution with NaN values and logging a warning when such failures occur.
- Adds a warning log when some target executions fail.
- Reindexes the output to align with all input rows, filling missing ones with NaN.
- Adjusts the concatenation order to ensure equal-length DataFrames.
Comments suppressed due to low confidence (2)
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py:620
- The function's docstring should be updated to mention that rows failing target execution are reindexed and filled with NaN, and that a warning is logged when failures occur.
if failed_lines > 0:
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py:638
- No unit test currently covers the scenario where some rows fail and are filled with NaN. Consider adding a test that simulates partial failures, asserts NaN in the missing rows, and verifies the warning is logged.
target_output = target_output.reindex(complete_index)
Description
Please add an informative description that covers that changes made by the pull request and link all relevant issues.
If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.
All SDK Contribution checklist:
General Guidelines and Best Practices
Testing Guidelines